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Abstract. We consider the set S(n, 0) of monic complex polynomials of de- 
gree n > 2 having all their zeros in the closed unit disk and vanishing at 0. For 
p G S(n, 0) we let |p|o denote the distance from the origin to the zero set of p' . 
We determine all 0-maximal polynomials of degree n, that is, all polynomials 
p 6 S(n,0) such that \p\o > |q|o for any q G S(n,0). Using a second order 
variational method we then show that although some of these polynomials are 
inextensible, they are not necessarily locally maximal for Sendov's conjecture. 
This invalidates the recently claimed proofs of the conjectures of Sendov and 
Smale and shows that the method used in these proofs can only lead to (already 
known) partial results. In the second part of the paper we obtain a character- 
ization of the critical points of a complex polynomial by means of multivariate 
majorization relations. We also propose an operator theoretical approach to 
Sendov's conjecture, which we formulate in terms of the spectral variation of a 
normal operator and its compression to the orthogonal complement of a trace 
vector. Using a theorem of Gauss-Lucas type for normal operators, we relate 
the problem of locating the critical points of complex polynomials to the more 
general problem of describing the relationships between the spectra of normal 
matrices and the spectra of their principal submatrices. 



Introduction 

Let S n be the set of all monic complex polynomials of degree n > 2 having all 
their zeros in the closed unit disk D. If p £ S n and a £ Z(p) then the Gauss-Lucas 
theorem implies that (a + 2D) n Z(p') ^ 0, where Z(p) and Z{p') denote the zero 
sets of p and p' , respectively. In 1958 Sendov conjectured that this result may be 
substantially improved in the following way: 

Conjecture 1. If p £ S n and a £ Z(p) then (a + D) H Z(j>') ^ 0. 

Sendov's conjecture is widely regarded as one of the main challenges in the 
analytic theory of polynomials. Numerous attempts to verify this conjecture have 
led to over 80 papers, but have met with limited success. We refer to [T3], [201 and 
[2"T] for surveys of the results on Sendov's conjecture and related questions. 

The set P n of monic complex polynomials of degree n may be viewed as a metric 
space by identifying it with the quotient of C" by the action of the symmetric group 
on n elements S n . Indeed, let r : C" — > C n /E ra denote the orbit map. Let further 
p(z) = Jl"=i( z ~ Zi ) ano - = n"=i( z — C») be arbitrary polynomials in P n and 
set 

A(p,q)= min max | m - Oj) I • 

ctGSti l<2<n 

Then A is a distance function on P n which induces a structure of compact metric 
space on the set S n = {p £ P n A(p,z n ) < 1} = r(D n ). Conjecture Q] is therefore 
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an extremum problem in the closed unit ball in P n for the function d given by 
d : S n — > [0, 2], p i — ► d(p) = max min \z — w\. 

z<£Z(p) wGZ(p') 

Note that d(p) is the same as the so-called directed (or oriented) Hausdorff distance 
from Z(p) to Z(p') (cf. [20 J. Since d is obviously a continuous function it follows 
by compactness that there exists p G S n such that d{p) = swp qeSn d(q). A polyno- 
mial with this property is called extremal for Sendov's conjecture. In 1972 Phelps 
and Rodriguez proposed the following strengthened form of Sendov's conjecture 

(cf. my- 

Conjecture 2. If p G S n is extremal for Sendov's conjecture then p(z) = z n + e tS 
for some 9 £ R. 

A proof of Conjecture [5] and thereby of Sendov's conjecture was recently claimed 
in [17]. There are currently eight different versions of [T7J, which we shall refer 
to as [T7J vk], 1 < k < 8. The method employed in loc. cit. consists in studying 
the dynamics of the zeros of the derivative of a polynomial in S n under certain 
perturbations of the zeros of the polynomial itself. Using these perturbations, a 
notion of extensible polynomial is defined and what is actually claimed in [171 vl- 
v2] is that a polynomial in S n is extensible unless it vanishes only on the unit 
circle. If true, such a result would imply that the polynomials in Conjecture [2] are 
not only all the extremal polynomials for Sendov's conjecture but also that they 
are in fact all the local maxima for the function d. The arguments used in [17[ 
vl-v2] were subsequently modified or replaced by completely new ones in [171 v3- 
v8]. Keeping track of so many changes and versions is both time-consuming and 
technically challenging. Unfortunately, this development has been rather confusing 
and has led some to believe that Schmieder's proof is correct or that his method 
would eventually work after some minor adjustments. The first main objective of 
this paper is to show that this is most definitely not the case. In section 1 we make 
a detailed analysis of first order variational methods in general and Schmieder's 
method in particular. We produce concrete counterexamples to the main claims in 
all eight versions of [T7J (see Theorem 11.21 and Propositions II ,2H1 .4|) . Moreover, in 
sections 1.3 and 2.2 we show that Schmieder's approach - or indeed any approach 
based exclusively on first order variational methods - cannot be successful. We also 
argue that all eight versions of the proof of Smale's mean value conjecture claimed 
in [TH] fail for similar reasons (section 1.4). 

In spite of the failure of Schmieder's approach, variational methods remain a 
natural way of dealing with Sendov's conjecture. The relatively few properties 
which are known to hold for locally maximal or indeed even extremal polynomials 
were all deduced by using such methods (see PQ, [8], [S], [IS])- In this spirit, Miller 
proposed a slightly more general extremal problem in [5] and [9]. Let (3 £ D and 
denote by S(n, (3) the set of all polynomials in S n which have at least one zero at 
(3. For a £ C and p £ S(n, (3) let 

\p\ a = min \a — w\ 

and define the a- critical circle to be the circle with center a and radius \p\ a - If 
p £ S(n, (3) is such that \p\ a > |g| a for any q e S(n, f3) then p is said to be maximal 
with respect to a in S{n,(3). For the sake of simplicity, maximal polynomials with 
respect to a in S(n, a), a G D, will be called a-maximal throughout this paper. A 
compactness argument similar to the one used for Conjecture[T]shows that maximal 
polynomials do exist for any a £ C and (3 £ D (cf. [5] Proposition 2.3]). In 1984 
Miller made the following conjecture: 
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Conjecture 3. If p G S(n,(3) is maximal with respect to a then all the zeros of p' 
lie on the a-critical circle and, given this, as many zeros of p as possible lie on the 
unit circle. 

For results pertaining to Miller's conjecture we refer to pQ, [5], [§] and [TS]. The 
relevance of a-maximal polynomials in this context is quite clear. As shown in 
Proposition 2.4], if p G S n is an extremal polynomial for Sendov's conjecture then 
there exists a G Z{p) such that p is a-maximal and \p\ a = d(p). A priori there 
may exist a G D such that if p is an a-maximal polynomial then \p\ a < d(p). The 
a-maximal polynomials that satisfy \p\ a — d{p) are particularly interesting not only 
because all extremal polynomials are necessarily of this type - as already mentioned 
above - but also because they may provide potential candidates for local maxima 
for the function d. It has been known for quite some time now that if |a| = 1 then 
z n — a™ is the only a-maximal polynomial (cf. [15j). Moreover, this polynomial was 
shown to be a local maximum for d ([TU], [27] )■ These are in fact all the examples of 
a-maximal polynomials known so far, as no such polynomials were found explicitly 
for |a| < 1. In section 2.1 we determine all 0-maximal polynomials (Theorem I2.1j) 
and study their properties. It turns out that all these polynomials satisfy Miller's 
conjecture as well as the relation \p\ — d{p). We next consider a special class of 
0-maximal polynomials, namely rotations of the polynomial p{z) = z n + z. If n > 4 
then p is locally maximal for a large class of variations of its zeros (Proposition ! 1.3[) . 
Furthermore, p is locally maximal for the restriction of the function d to S(n, 0) 
and it is also inextensible with respect to 0. These properties and the symmetrical 
distribution of the zeros and critical points of p seem to suggest that if n > 4 then 
p and its rotations could in fact be locally maximal for Sendov's conjecture. The 
discussion in section 1 shows that first order variational methods are not enough 
for deciding whether this is true or not. In section 2.2 we use a second order 
variational method to prove that - contrary to what one might expect from the 
aforementioned properties - the polynomial p is not locally maximal for Sendov's 
conjecture. Indeed, Theorems 12.31 and 12.41 show that if n = 4 or 5 then p is a kind 
of inflection point for the function d. We conjecture that the same is actually true 
for all degrees and also that polynomials of the form z n + e l9 , 8 G R, are in fact all 
the local maxima for d (Conjecture HJ. These results complement those obtained 
in section 1 and show quite clearly that the methods used in |17j cannot provide 
successful ways of dealing with Sendov's conjecture in its full generality. 

So far, almost all the results on Sendov's conjecture and related questions were 
obtained by analytical arguments. As pointed out in pQ, the fact that dor fails to 
be a (logarithmically) plurisubharmonic function in the polydisk D n accounts for 
many of the difficulties in studying locally maximal polynomials for Sendov's conjec- 
ture. On the other hand, the geometrical information contained in the Gauss-Lucas 
theorem is hardly sufficient for dealing with Conjectures [1] and [51 This is mainly 
because of the implicit nature of the relations between the zeros and critical points 
of complex polynomials. Describing these relations geometrically and as explicitly 
as possible would be helpful for a great many questions in the analytic theory of 
polynomials. In section 3 we propose an operator theoretical interpretation and 
approach to Conjectures Q] and [2] (Conjecture [5]). Moreover, we show that these 
conjectures may be viewed as part of the more general problem of describing the 
relationships between the spectra of normal matrices and the spectra of their prin- 
cipal submatrices (Problem [3]). We also give a geometrical characterization of the 
critical points of complex polynomials by means of multivariate majorization rela- 
tions. These results use a combination of operator theoretical tools and methods of 
majorization theory. Such methods were the key to Pereira's recent solutions to the 
1947 conjecture of de Bruijn and Springer and the 1986 conjecture of Schoenberg. 
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Similar ideas were used by Malamud in [BJ, where he not only proved these same 
two conjectures - independently and almost at the same time as Pereira - but he 
also obtained a remarkable generalization of the de Bruijn-Springer conjecture (see 
section 3.2). The methods of [BJ and [H] seem to be particularly well suited for 
studying extremal problems for which the loci of the zeros of extremal polynomials 
are (conjectured to be) lines in the complex plane. We believe that the results and 
the setting developed in section 3 should prove useful for investigating the "spec- 
tral form" of Sendov's conjecture (Conjecture E]) as well as geometrical properties 
of the spectra of normal matrices and their degeneracy one principal submatrices 
(Problem©. 

1. First order variational methods and the conjectures 
of Sendov and Smale 

Variational methods are a natural approach to both Sendov's conjecture and 
Smale's mean value conjecture. As a matter of fact, most of the results concerning 
the general cases of these conjectures were obtained by such methods (see, e. g., p], 
EH, HO], [IS] j US], [3BJ, [57]). In this section we make a detailed analysis of the 
proofs of these two conjectures that were recently claimed by Schmieder in [TTJ and 
[TS] , In particular, we show that all sixteen versions of [TTJ and [TH] are incorrect 
and also that Schmieder's approach - or indeed any approach based exclusively on 
first order variational methods - cannot be successful. 

1.1. Some inextensible polynomials. The notion of extensible polynomial in- 
troduced in [T71 vl-v2] amounts to a solvability condition for a certain system of 
inequalities. This system is linear in the generic case when both the polynomial 
and its derivative have only simple zeros. The main claim in |17l vl-v2] is that 
a polynomial in S n is extensible unless it vanishes only on the unit circle. In this 
section we produce counterexamples to this claim for all degrees in the generic case. 
We also show that in most of the non-generic cases the system of inequalities men- 
tioned above is not linear. Thus, as it stands in [171 vl-v2], the notion of extensible 
polynomial is not well defined in these non-generic cases. 

Although the following definition of local maximality may seem obvious, it is 
actually quite different from the one used in [17] . 

Definition 1.1. A polynomial p 6 S n is called locally maximal for Sendov's con- 
jecture if it is a local maximum for the function d, i. e., if there exists e > such 
that for any q S S n satisfying A(q,p) < e one has d(q) < d(p). 

For p € S n we shall use the following notations: 

n n— 1 

p(z) = Y[( z - z i), a = zi, \p\a=d(p), p'(z)=nY[(z-w j ), 

i=l 3=1 I 1 " 1 ) 

\wj — a\ = \p\ a for 1 < j < r, \iVj — a\ > \p\ a for j > r + 1. 
Let hi, . . . , h n £ D, t G [0, 1[, and set 

Zi(t) = Zi(t, hi,..., hn) = 7^ + - Zl , 1 < i < n, 

1 + thiZi 

n n-1 (1-2) 

q(z) = q(z;t,h 1 , . . .,h n ) = J|(z - Zi(t)), q'{z) = n J| (z - Wj(t)). 

i=i 3=1 

Note that 5 e S n , Zj(0) = Zi, 1 < i < n, and Wj(0) = Wj, 1 < j < n — 1. Let us 
assume for now that 

p and p have only simple zeros. (1-3) 
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By the implicit function theorem the curves Wj(t), 1 < j < n— 1, are differentiablc 
in a neighborhood of 0. Straightforward computations lead to the following result 
(cf. [m v2, Lemma 1]): 

Proposition 1.1. For all sufficiently small t > and 1 < j < r one has 



j (t)-z 1 (t)\ = \p\ a {l + n 



.i=i 



(ai(wj) + bi{wj) 



t + Oit 2 ) 



wht 



a x {w j ) 
Oi(Wj) = — 



p(Wj 



(wj — a) 2 p"(wj) 



2 < i < n, 



(Wj - a)(wj - Zi) 2 p"(wjY 
and bi(wj) = —zfa,i(wj), 1 < i < n. 

Remark 1.1. The notation bi(wj) was used in [T71 vl-v2]. The coefficients bi(wj) 
were subsequently denoted by a*(wj) in [17, v3-v5]. 

From Proposition 1 1 . 1 1 we see that if there exist hi, . . . , h n G D such that 



E 

i=l 



> 0, 1 < j < r, 



(1.4) 



then d(q) > \q\ Zl (t) — mini<j< r \wj(t) — Zi(t)\ > \p\ a — d(p) for all small t > and 
thus p cannot be a local maximum for d. Define the following r x n matrix: 

B(p) = (fiij), where /3y = aj(w t ) + bj(wi), 1 < i < r, 1 < j < n, (1.5) 

and set h = (ftj, . . . ,h n ). Then (| 1 .4|) may be written as 



3? (b(p)I 



h)>0. 



(1.6) 



From the above discussion one concludes that if system (|1.6p has a solution ft, e C" 
then the polynomial p cannot be a local maximum for the function d. The following 
definition is essentially the same as Definition 2 in [T71 v2]. 

Definition 1.2. The polynomial p is extensible with respect to its zero a if system 
(|1.6p is solvable. Otherwise, the polynomial p is said to be inextensible. 

In order to see what Definition 11.21 means in terms of the coefficients of B(p) let 
us first recall [H Definition 2.19]. 

Definition 1.3. A complex m x n matrix M = (m^) is positively singular if there 
exist hi, . . . , ji m > 0, not all 0, so that J2iLi ^^ij = for 1 < j < n. 

The next result is a generalization of what is usually called the fundamental 
theorem of linear programming and may be found in [141 Theorem 22.2] (see also 
[1 Lemma 2.20]). 

Theorem 1.1. Let M be a complex m x n matrix and let zG C" denote a vector 
of complex unknowns. The system (Mi) > has no solution if and only if M is 
positively singular. 

From Definitions 1 1 . 214T731 and Theorem ll.ll we see that p is extensible with respect 
to its zero a if and only if B(p) is not positively singular. The raison d'etre of 
Definition 11.21 is now quite clear: extensible polynomials cannot be local maxima 
for the function d. Actually, this same necessary criterion for local maximality 
has already been obtained and used by Miller in [H] . Note though that inextensible 
polynomials need not be local maxima for d either, as one can see from the following 
simple example. 
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Lemma 1.1. If 6 £ R then the polynomial p(z) = z 3 + e l6 z is inextensible with 
respect to but it is not a local maximum for d. 

Proof. The fact that the polynomial p is not extensible with respect to is a special 
case of Theorem 11.21 below. Note that d(p) — \p\o = and that by a rotation we 
may assume that p(z) = z 3 — z. Let t £ [0, 1] and set 

q t (z) = (z - it)(z 2 -1). 

Then q t £ S3 and elementary computations show that for all small t > one has 



l + t 2 

d(qt) = \qt\it = \J—^— > d (p)> 
which proves the lemma. □ 

Remark 1.2. In section 2.2 we shall construct explicit examples of inextensible 
polynomials of degree greater than three which are not local maxima for d. 

The main claim in 1 7 vl-v2] is as follows: 

Claim 1. If n > 4 then the polynomial p is extensible with respect to its zero a 
unless p vanishes only on the unit circle. 

If true, Claim [T] would imply that the polynomials in Conjecture [2] are not only 
all the extremal polynomials for Sendov's conjecture but also that they are in fact 
all the local maxima for the function d. The following theorem shows that Claim [T] 
is actually false. 

Theorem 1.2. Let 9 £ R, n > 3, and p(z) = z n + e l9 z. Then \p\o = d{p) and p is 
inextensible with respect to 0. 

Proof. It is enough to prove the statement for the polynomial p(z) = z n — z since 
z n + e ie z = e~ ma p{e la z), where a = ^5? . Recall the notations of (|1.1|) and set 

2^0-2) flY^ 1 2xi(fc-l) 

(a =) z\ — 0, Zj — e , 2 < j < n, Wk = I —J e , 1 < k < n — 1. 

Note that |p|o = d(p) and r = n — 1, so that the matrix B(p) defined in (|1.5|) is 
actually a (n — 1) x n matrix. Proposition 11.11 and elementary computations yield 

1 \ n + 1 i / \ W i -i ■ ^ ■ 

O'lyWi) = and aAwi) = — — , l < 1 < n — l, 2 < j < n. 

nwi n(Wi — Zj) z 



Using the fact that for z £ C \ Z(p') one has the well-known identities 

and 

-p"( Z y 



P ( z ) = \p 1 

p'{z) ~[ Z ~ W l 

p^\z)p'(z) - (p"(z)) 2 _ d_ 
(p'(z)Y ~ ~dz 

one can show that if 2 < j < n then 



p'(z) 



E 



_ X Z - Wi) 2 



n—1 n—1 1 n—1 1 

^-^ f Of |. _ r,V 1/1- — J t-^t 



2 



P"(*j) 



(p"(z 3 )Y^p^(z 3 )p'(z J ) 

(p'( Zj )Y 



n 

Zi 
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Since /3y = a,j{wi) + bj(wi) = a,j(wi) — ZjO,j(wi) it follows that 

-— 1 n + 1 " 1 1 " 1 1 

; Ai = — Y\ — = and V fa = z 3 = 0, 2<j<n, 



i=l i=l i=l J 

which shows that B(p) is positively singular. By Theorem 11.11 the polynomial 
— z n — z cannot be extensible with respect to its zero z\ = 0. □ 

In order to explain why the proof of Claim[T]given in [17, vl-v2] fails let us recall 
(jl.ip and the notations of Proposition 11.11 and define the following r x n matrix: 

A(p) = (oiij), where ay = aj(wi), 1 < i < r, 1 < j < n. 

In [T"?] v2, Lemma 4] it is shown that if the polynomial p does not vanish on the 
unit circle then B(p) is positively singular if and only if A(p) is positively singular. 
We reformulate this result as follows: 

Lemma 1.2. Assume that the polynomial p is as in and that it does not 

vanish on the unit circle. Then p is inextensible with respect to its zero a if and 
only if A(p) is positively singular. 

The proof of Claim[T]given in section 6 of [TTl v2] is based on the following claim, 
which is a much stronger version of Lemma 11.21 

Claim 2. If n > 4 and the polynomial p is inextensible with respect to its zero a 
then A(p) is positively singular. 

Claim [5] is a crucial step in section 6 of [TTl v2] since all the arguments used 
in loc. cit. rely heavily on various properties of A(p). As we shall now explain, 
Claim [5] is false. Indeed, the polynomials in Theorem 11.21 show that Lemma 11.21 
cannot hold without the assumption that p does not vanish on the unit circle: 

Proposition 1.2. Let 9 € R, n > 3, and p(z) = z n + e z. Then the polynomial p 
is inextensible with respect to and A(p) is not positively singular. 



The first part of the statement in Proposition 11.21 was proved in Theorem 11.21 
As we shall see below, the second part of this statement is a consequence of the 
following more general result: 



Theorem 1.3. Let the polynomial p be as in (jl.ip . Assume that p satisfies (|1.3p 
and define the following (n — 1) X (n — 1) matrix: 

c (p) = (7ij)i where 7y = (lo* - Zj)~ 2 , 1 < i < n - 1, 2 < j < n. 

Then det(C(») ^ 0. 

For the proof of Theorem l 1 . 31 we need [H Lemma 2.1], which we restate as follows: 

Lemma 1.3. // the polynomial p is as in (jl.ip and has only simple zeros then 
there exist neighborhoods U, V C C of the points u = (a, Wi, . . . , w n —i) and 
(a, Z2, ■ ■ ■ , z n ), respectively, such that 

U 3 . . . ,uj n -i) i-> (Ci,C2, • e V 

is an analytic function, where Cz> • • • j C« are the (simple) zeros different from a of 
the polynomial n f a lX£=i ( w ~~ Uj)dw and £i = Ci( a > u i> ■ ■ ■ 7 S-i) = oc. 

Remark 1.3. We note for later purposes that if the polynomial p is as in (jl.ip 
and satisfies ()1.3p then it follows from Lemma fT~3l and the inverse function theorem 
that the functions u>±,... ,w„_i are locally analytic in Ci? Cz> • • • > Cn and one has 

p{wj) 



dujj 



(Wj - z l ) 2 p"{w j ) 



1 < i < n, 1 < j 1 < n — 1, 
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where v — (zi, 22, ■ ■ . , z n ). Using these identities one can easily compute the coef- 
ficients cii(wj) in Proposition ll.il 

Proof of Theorem \1.3[ Note first that since p satisfies (|1.3|) we may choose the neigh- 
borhoods U and V in Lemma IT751 such that if (a, ui\, . . . , ui n -i) € U then the points 
u>i, . . . ,u) n -i are distinct and the map U 3 (a,u>i, . . . ,oj n _i) i— > (Ci,C2, • ■ ■ >Cn) S V 
is onto. Since Wj, 1 < i < n — 1, are zeros of the logarithmic derivative of the 
polynomial n"=i( z _ 0) = n /a E["=i ( w ~ Uj)dw we get 

= 0, l<i<n-l. 



3=1 

By partial differentiation we obtain 



; =1 ' - Cj 



90 




duii 


U 


(wi -Zj) 2 






duj k 


U 



1 _ p"(u>i) 



3=2 «• ' 3 



(1.7) 



n 



= 0, l<£^fc<n-l, 



where u = (a, Wi,W2, ■ ■ ■ , io n _i). Moreover, by [H Lemma 2.3] one has 
d(i 1 f Zj p'M , 



P'( z 3") A w ~ u 'k 

6 jk = - ZTT^T^TT. \ I -^^~ dw > 2<i<n,l< k<n-l, 



duj k 

Thus, if we set 

p(wk) f Zj p'(w) 

p'(Zj)p"(w k ) J a W-Wk 

and define the (n — 1) x (n — 1) matrix D(p) = (djk) then (|1.7j) may be rewritten as 
C(p)D(p) = I n — i) where I n _i is the (n— l)x(n— 1) identity matrix. This implies 
that det(C(p)) ^ 0, which proves the theorem. □ 

Proof of Proposition \1. 6 A As in the proof of Theorem II. 2\ we may assume without 
loss of generality that p(z) = z n — z. Note that for this polynomial the entries 
ctij = aj(wi) of the (n — 1) x n matrix A(p) were already computed in the proof 
of Theorem 11.21 Let A'(p) denote the (n — 1) X (n — 1) matrix which is obtained 
from A(p) by deleting the first column. If A(p) were positively singular then the 
same would have to be true for A'(p). But this is impossible since p satisfies (ll.3[) 
and by Theorem 1 1.31 one has 

det(A'(p)) = f-i) det(C(p)) ^ 0. 



Therefore A(p) cannot be positively singular. □ 

1.2. Multiple critical points on the critical circle. Having explained some 
of the weaknesses of the arguments in |17| vl-v2] in the generic case, let us now 
examine how these arguments are affected if assumption (|1.3p is removed. We use 
the same notations as in (|l.l[) ~ p.2j) . For the sake of simplicity, we assume that 

p has only simple zeros, r > 2, and w± — w% = . . . = w r . (1-8) 

Thus, the only zero of p' that lies on the a-critical circle of p is w\ with multiplicity 
r > 2. Let (hi,. . . ,h n ) £ D n and set 

c= : , d = d(h 1 ,...,h n ) = -p(wi) (1.9) 

r! f-j' (wi - Zi) 2 
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Note that c ^ and that there exists a closed thin set C D n such that d ^ if 
(hi, . . . , h n ) E D n \ O. Therefore, the quantities 



Lk — Lk(hi, 



exp 



t arg 



p{wi) 



1 



■ arg[ - 



2?rfc 
r 



(1.10) 



are well defined for 1 < k < r and (hi, . . . , h n ) £ D n \ fi. In sections 3-5 of [TTl 
vl-v2] it is shown that if (hi, . . . , /i„) <E 5" \ f2 then there exist e > and distinct 
curves wij(t), 1 < j < r, t € ]0, e], such that 

dq(z,t, hi,..., h n ) 



dz 



for t G ]0, e] and lim wij(t) = Wi, 1 < j < r. 



lUlj(t) 



As in Proposition ll.il and (|1.5[) . the coefficients of the linear terms in the first-order 
Taylor expansions of \wij(t) — Zi(t)\, 1 < j < r, are viewed as the entries of an 
r x n matrix B(p) = (J3%j), 1 < z < r, 1 < j < n. However, these entries will now 
depend on the parameters hi, . . . , h n . Indeed, the computations in [17, vl-v2] (cf., 
e. g., formula (11) in [TTl vl-v2]) show that 



Pij = a j( w i) + bj(wi), 1 < i < r, 1 < j < n, where 

a j (w l ) = -- —, 2<j<n, , . 

(w{ - a)(wi - ZjY U- 11 ) 



ai(wi) = — 



(Wi - a) 2 



bj(wi) = -Zjaj(wi). 



Remark 1.4. The error that initially appeared in formula (11) of 17, vl-v2] was 
later corrected in formula (20) of [ITl v3-v4] and formula (15) of [TTl y 5]- 

From (|1.9| - (|1.11|) one can see that in this case system (jl.6|) is not linear, so that 
Theorem 1 1.1 1 can no longer be used. This invalidates the proof given in [I7J vl-v2, 
section 6] for non-generic cases since the arguments which are used in loc. cit. are 
based on the assumption that system (|1.6[) is linear and they rely heavily on Thco- 
rcm ll.ll For the same reasons, the word "linear" should be removed from Definition 
2 in jTTj vl-v2, section 5] if one would still like to have a notion of extensible poly- 
nomial which is at least properly defined in non-generic cases. 

1.3. First order approximations of the critical points: further drawbacks. 

The arguments used in [171 vl-v2] were subsequently modified or replaced by com- 
pletely new ones in [171 v3-v8]. Keeping track of so many changes and versions is 
both time-consuming and technically challenging. Unfortunately, this development 
has been rather confusing and has led some to believe that Schmieder's proof is 
correct or that his method would eventually work after some minor adjustments. 
This is most definitely not the case, as we shall now explain. 

First of all, as we already saw in the previous sections, the notion of inextensible 
polynomial which was implicitly defined in [17[ vl-v2] is much weaker than the 
notion of locally maximal polynomial for Sendov's conjecture (see Definition II. ip . 
In [TTl v3-v7, Definition 2] Schmieder tries to make these two notions synonymous 
by using a new definition of local maximality. However, this new definition is still 
quite different from the actual definition of local maximality which was given in 
Definition 11.11 As a matter of fact, it is ambiguously formulated and inaccurate 
on several points. For instance, the words "decreasing" and "increasing" in this 
definition should be interchanged. Moreover, as it stands in [TTl v3-v7], Definition 2 
is valid only in the generic case when the polynomial p has no multiple critical points 
on the critical circle. Finally, this definition is inadequate in that it does not allow 
arbitrary variations of the zeros of p. Indeed, it only allows variations of the type 
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described in (jl.2p . These variations are too restrictive since boundary zeros are 
always sent to boundary zeros (as is well known, locally maximal polynomials must 
have at least two zeros on the unit circle (cf., e. g., [12])). 
Let us now examine the arguments used in |17[ v3-v8] . 

Versions 3-5 of |17j . One of the main claims in [171 v3-v5] is Theorem 1, which 
asserts that if a polynomial p has a multiple critical point £ on the critical circle 
and if p(£) ^= then p cannot be locally maximal unless all its zeros lie on the unit 
circle. However, the proof of Theorem 1 given in |17[ v3-v5] is not valid. Indeed, 
a crucial part of this proof is contained in the paragraph starting with "For such 
h we investigate the sign of the expressions Fu(t) ..." that immediately precedes 
Theorem 1. This paragraph contains several erroneous arguments: for instance, 
the conditions "Fu(t) < for all — e < t < e and all such Z" are clearly wrong and 
should be replaced by "mini<i< CTfc Fu(t) < for all — e < t < e". This invalidates 
the concluding argument which basically says that if p is locally maximal then 
= Oro(Cu) + <„(Cii) = a m ((u) - z^a m (Ci/) for 1 < I < a k and so \z m \ = 1. 

Another major objection to the arguments used in [T7J v3-v5] is that these are 
based on a criterion for local maximality ( |17[ v3-v4, Lemma 2]) which is incorrect. 
Indeed, this criterion essentially claims that a polynomial p with simple critical 
points on the critical circle is locally maximal if and only if system (|1.6j) is not 
solvable or, equivalently, if and only if the matrix B(p) defined in (|1.5|) is positively 
singular (cf . Theorem II. ip . This amounts to saying that p is locally maximal if 
and only if it is inextensible in the sense of Definition 11.21 The example given in 
Lemma fTTTI shows that this is definitely wrong. As one can see from Proposition ll.il 
if t > is sufficiently small and the polynomial q is as in (|1.2|) then the best one can 
say is that if p is inextensible then d(q) < d(p) +0(t 2 ). If anything, this shows that 
if p is inextensible then first order approximations of the critical points of p - like 
those used in all versions of [T7J - are not enough for deciding whether p is locally 
maximal or not. To do this one has to use higher order approximations of the 
critical points (see section 2.2 below). Thus, the correct version of Lemma 2 in [P7|, 
v3-v4] should be stated as the following necessary criterion for local maximality: if 
a polynomial p with simple critical points on the critical circle is locally maximal 
then its associated matrix B(p) is positively singular. As mentioned in section 1.1, 
this criterion has already been used in [5] . Note also that a somewhat more flexible 
necessary criterion for local maximality was obtained in [T] (the latter criterion 
imposes no conditions on the critical points and assumes only that p has simple 
zeros on the unit circle). 

Versions 6-8 of |17j . The main claim of [171 v6-v8] is Theorem 1, which may be 
formulated as follows: 

Claim 3. Let p E S„ and Z\ £ D be such that p{z\) = 0. Then there exist 
a complex number wq with \wo\ — 1 and p* € S n such that p*(wo) — and 
min Cez( P *') K - CI > min weZ(p /) \z x - ui\. 

The proof of Claim [3] given in [171 v6-v7] is quite different from the one given 
in [T71 v8]. In [TTJ v6-v7] Schmieder considers an arbitrary polynomial p(z) — 
(z — zi)r(z) e S n such that d(p) = \p\ Zl = min we ^( p /) \z\ — u>\ and he defines the 
following perturbations of p(z): 

Q(z) — Q(z; t, h) = (z — zi(t, h)) r(z), where 

t e [0,1], hedD and Zi(t,h) = Zl + - H . 

1 + thz\ 
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A careful examination shows that if true, the local arguments used in section 3 of 
[T7| v6] and section 2 of [T71 v7] would actually imply that the following claim must 
be valid as well. 

Claim 4. There exists h £ dD such that for all sufficiently small t > one has 
min Cez(Q') h ) - CI > min^g^p/) \zi - u\, that is, d(Q) > d(p). 

We shall now construct counterexamples to Claim 2] for each degree n > 4. To 
do this, recall the notations used in the proof of Theorem 11.21 and let 



p(z) = z n — z = Y\^( z ~ z i) ana - P'( z ) — 71 W_( z ~ w k), where z\ = 0, 

j=i k=i ( 112 ^ 



2iri(j-2) 

e "- 1 , 2 < j < n, and to/- 



1 < fc < n - 1. 



Let further k be a fixed positive number and (et,...,e n ) € C n be such that \Ej \ < 
|ei| 1+K for 2 < j < n and set 



Q(z) = Q(z;ei, . . .,e n ) = - (zj+Ej)). 

Using the notations in (|1.12|) - ([1.13|) we can show the following result: 
Proposition 1.3. If n > A then for all sufficiently small \ei\ one has 

IT 



(1.13) 



min \zi + £i — CI < min \ z i — u>\ — 



cos 



n- 1 



ei • 



Thus, if Ej — /or 2 < j < n or, more generally, if \zj +£j\ < 1 /or 2 < j < n then 
for all sufficiently small > one has Q £ S n and d(Q) < d{p). 

i 

Proof. Note that \uik\ — = \p\z ± = d(p), 1 < k < n — 1, and denote the zeros 

of Q'(z) by ojk — <^fc(£i, ■ ■ ■ , s n ), 1 < fc < n — 1. We assume that these are labeled 
so that ujk(0, ■ ■ ■ , 0) = Wk, 1 < k < ri — 1. It follows from (|1.12p that if ej 7^ then 
there exists j E {1, . . . , n — 1} such that 



I argei - argw^l < 



n-T 



so that ( — I > 



l£i| 
d(p) 



cos 



n — 1 



(1.14) 



Now using Remark 11.31 with v = (0, 22, . . . , z n ) and the computations in the proof 
of Theorem 11.21 together with the assumption that \ei\ < \ei\ 1+K for 2 < i < n we 
get 



\u>j(e-L, ...,e„) -ei| = 



n ^ 

EOUlj 



n+1 



1? 



£ , + 0(M 2 ) 



+ 0(N 2 ). 



(1.15) 



From (|1.14p - (|1.15p and the inequality n + 1 > nd{p) it follows that for n > 4 and 
small lei I one has 



|Wj(£i, . . . ,e„) - £i| < - 

< d(p) ~ 
which proves the proposition. 



n+1 j it 

cos 



nd{p) 



cos 



n- 1 



n- 1 

£1 



N + 0(N 2 ) 



□ 
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Clearly, Proposition 11.31 contradicts Claim H] and so it invalidates the results of 
PH v6-v7]. 

In [TTl v8] Schmieder uses a modified variational method and replaces the local 
arguments of |17[ v6-v7] with some global arguments. Given a polynomial p(z) — 
{z — zi)q(z) £ S n he considers the one-parameter family of polynomials 

Q(z; u) = [z — u)q(z), where u £ D, (1-16) 

and studies the Riemann surface 1Z that consists of the critical points of Q when 
u varies in D (that is, the zeros of the equation -^Q(z;u) — for u £ D). Note 
that the compact manifold TZ contains at most 2(n — 1) branch points. A close 
examination of sections 2 and 3 of (TTJ, v8] shows that if valid, the arguments used 
in the proof of Claim [3] given in loc. cit. would actually imply that the following 
claim - which is a much stronger statement than Claim [3]- is also true. 

Claim 5. Let p(z) = (z — zi)q(z) and p*{z) — (z — wo)q(z), where q £ S n —i 
and z\ and wq are complex numbers such that \zi\ < 1 and \wq\ = 1. Then 
min Cez( P *') K - CI > min we 2(p') \zi - u>\. 

The following proposition shows that Claim [5] fails in a very strong way. 

Proposition 1.4. If n > 5 then there exist z± £ D and q £ S n -i such that for any 
complex number wq with \w$\ — 1 one has min^^,') |wo — CI < m i n ^ez(p') \ z i~ w\, 
where p{z) = [z — zi)q(z) and p*{z) = (z — wo)q(z). 

Proof. Let wq £ C be such that \wq\ — 1 and set z\ — and q(z) = z™ -1 — 1, so 

that p(z) = z n — z and p*(z) = [z — wo)(z n ^ 1 — 1). For any ui £ Z(p') one has 
i 

\z! -w\ = (i) , so that 

min \zx-w\ = (-) " 1 . (1.17) 



Since |t0o| = 1 and Z(q) = {e : 0<fc<n — 2} one gets from either the Schur- 
Szego composition theorem ([T3J Theorem 3.4. Id]) or the Grace- Heawood theorem 
([T3J Theorem 4.3.1]) that 

. , min aeZ( ) \w -a\ sin (n/2(n - 1)) 
mm \wo — C < ^ . , . r < —. — . (1.18) 

Cez(p*') 2sm{TT/n) sm(n/n) 

From (|1.17p and (|1.18p we see that in order to prove the proposition it is enough 
to show that 

sin(7r/2(n-l)) /l\^ 

, —. —. . — — < - for n > 5. (1-19) 

sm(7r/n) \n J 

Numerical checking shows that (|1.19p is true for n = 5, 6, 7 or 8, and so we may 

7T 7T7T Sinx . 

assume that n > 9. Since < — ; < — < — for n > 5 and x — =■ is an 

2(n - 1) n 4 ~ y/x 

increasing function on I 0, — I we get that 



4, 

sin(7r/2(n - 1)) 

sin(7r/n) " V 2(n — 1) 



< J— for n > 5. (1.20) 



The sequence \ 2(l — — ) — n™- 1 is clearly increasing for n > 2, so that 




I)-n^>--3V4 >0 
rt / 3 



whenever n > 9. This implies that the right-hand side of (|1.20p is always less than 
(i) ~ if n > 9, which proves p~T^)l . □ 
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Let us finally note that the condition |ioo| = 1 is never really used in sections 2 
and 3 of [TTJ v8], which is quite strange. Indeed, the "blowing up and pulling back" 
technique used in loc. cit. is in fact a kind of projectivization method for which 
no assumption on wq other than \wq\ < 1 seems to be necessary. Proposition 11.41 
shows quite clearly that such arguments cannot be valid. 

Remark 1.5. We have pointed out only some of the most serious (actually, ir- 
reparable) errors in p~7l vl-v8]. However, several other technical or conceptual 
mistakes appear in loc. cit. For instance, Lemma 3 in [171 v3] is contradicted by 
the polynomials Pi(z) = [z — 1)" and p%{z) — {z + 1)" or indeed any sufficiently 
small perturbations of these polynomials. Also, the identity "log(p(Q(-, t))) = 
log|Cfc(i) — - which is consistently used in [T71 v6-v8] - is definitely wrong 

and should be replaced by "log(p(Q(-, t))) = min^ log \Ck{t) — Zi(t)\" . 

Conclusion. The failure of the method proposed in |17j is mainly due to the fact 
that the local arguments used in loc. cit. are based on first order approximations 
of the critical points of a polynomial. As we already explained, such arguments 
lead only to necessary conditions that do not usually provide enough information 
for deciding whether a given polynomial is locally maximal for Sendov's conjecture 
or not. This is hardly surprising since it is actually a common feature of most 
non-trivial extremal problems, which often require higher order approximations. 
A real- valued C 1 -function / of one real variable such that /' vanishes at a point 
where / does not have a local maximum is arguably the simplest example that 
comes to mind in this context. In section 2.2 we shall construct explicit examples 
of polynomials for which second order approximations of their critical points are in 
fact the only way to prove that they are not locally maximal for Sendov's conjecture. 
It is therefore highly unlikely that the approach proposed in [17] could be made 
into a successful method for dealing with Sendov's conjecture in its full generality. 

1.4. Smale's mean value conjecture. In his 1981 work on the complexity of al- 
gorithms for successive root approximation Smale conjectured that any polynomial 
p E C[z) of degree n > 2 with p(0) = and p'(0) ^ satisfies the following mean 
value property (cf. "2*3} Problem IE]; see also [2*4]): 



Reviews of the results known so far on Smale's mean value conjecture and related 
questions have appeared in [I"">j , [T^ and [*"T] . A proof of this conjecture was recently 
claimed in 18 J. The variational method used in loc. cit. is the same as the one that 
was used in [17] for Sendov's conjecture. A notion of extensible polynomial similar 
to the one defined in [17] is introduced and it is claimed that p is extensible unless 
all the roots of p other than have the same absolute value. Unfortunately, the 
proof given in [18] relies upon the same erroneous or insufficient arguments as those 
described above in the case of Sendov's conjecture. Indeed, a close examination 
shows that except for some minor changes, the arguments used in [THl vl-v3], [HI 
v4-v5], [HI v6-v7] and [HI v8] are pretty much the same as those used in \F7\ vl- 
v2], [TO v3-v5], [T71 v6-v7] and [TO v8], respectively. Thus, the proofs of Smale's 
mean value conjecture claimed in [151 vl-v8] are not valid. For the same reasons, 
one may safely say that there is hardly any chance that the approach proposed 
in loc. cit. could be made into a successful method for dealing with Smale's mean 
value conjecture in its full generality. 

On a different note, we would like to mention an interesting recent preprint of 
Tyson f|26p where counterexamples to Tischler's strong form of Smale's mean value 
conjecture (cf. [25]) were constructed for each degree n > 5. 
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2. 0-MAXIMAL POLYNOMIALS, SECOND ORDER VARIATIONAL METHODS, AND 
LOCAL MAXIMA FOR SENDOV'S CONJECTURE 

The polynomials z n — a n with \a\ = 1 are the only concrete examples of a- 
maximal polynomials known so far. As pointed out in the introduction, these are 
also the only known examples of local maxima for Sendov's conjecture. In this 
section we first give a complete description of 0-maximal polynomials of arbitrary 
degree. We then use a second order variational method to show that - despite all 
appearances - these polynomials are not necessarily locally maximal for Sendov's 
conjecture. We conjecture that polynomials with the latter property must in fact 
be a-maximal with |a| = 1 (Conjecture 2] below). 

2.1. Classification of O-maximal polynomials. In this section we determine all 
0-maximal polynomials and study some of their properties. We start with a few 
preliminary results, the first of which is well known from the theory of self-inversive 
polynomials: 

Lemma 2.1. Let p(z) = ^2^ =0 cikZ k be a complex polynomial of degree n > 1. // 
all the zeros of p lie on the unit circle then a^ao = a„a n _fc, < k < n — 1. 

From Lemma 2.1 we deduce the following result. 

Lemma 2.2. Let q be a complex polynomial of degree n > 2 and a G Z(q). If 
R > is such that \w — a\ — R for any w € Z(q') then 



n R 2 (">-V q >( a + z ) = (Z '( a )z"- 1 (? '(a + i? 2 ^ 1 ) 
for all z G C. that is, 



{n-k-l)\nR 2k q {k+1) {a) = k\ q' {a) q( n ~ k ) (a), 0<k<n-l. (2.1) 
Proof. By assumption, the polynomial 



q'(a + Rz) = 



k=0 



is of degree at least one and has all its zeros on the unit circle. The identities in 
(12. ip follow by applying Lemma 2.1 to the polynomial q'(a + Rz). □ 

We shall also need the following lemma. 

Lemma 2.3. If n is an integer greater than two then 

jx G [l,n— 2] : n^ 5 " (n ~ x) - n(x + 1) = j 

Proof. The assertion is trivially true for n = 3 and we may therefore assume that 
n > 4. Note that 

(-l,n) 3 x i ► f(x) = \ - l)log?i + log( ^—^ 

\n — 1 / \x + 1 

is a continuously differentiable function which satisfies f(n — 1 — x) + f(x) — 0, so 
that / f 2 ^ 1 ) =0. It is then easily seen that the lemma is in fact equivalent to the 
following statement: 

f{x) ^0 for any xE [1, n - 2] \ | . (2.2) 

In order to prove (|2.2[) we set 

n + 1 / n + 1 



71 — 1 \Tl — 1 log 7! 
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and notice that 

< A n < (^zj) if n - 2 - 

It follows that -1 < < < x+ < n, where x± = n ^-(l±y/A^). An 

elementary computation yields 

21o g n (,-,+)(.-, ) x 
n—1 [n — x)[x + 1) 

which shows that the function / is increasing on (x—,x+) and it is decreasing on 
both (— and (x+,n). It is not difficult to prove that 

/ n _3\ 2 

n G {4, 5, 6} =>■ 41ogn > n + 1 A„ > =*> > n - 2 

and also that 

n > 7 =>• 41ogn < n + 1 =*> A„ < [ - — -) =>■ x + < n - 2, 

which together with the identity x+ + X- = n — 1 show that 

2] C (x-,x+) if n G {4,5,6} and [x_,a; + ] C (l,n- 2) if n > 7. (2.4) 
Moreover, the inequality 

2 n - 1 (l + —^—) >71 2 , 73 > 4, 



n- 1 



implies that 



j-i — -i IT) 1 \ 

/(n-2) = -/(l) = — logn-logf — J > if n > 4. (2.5) 

From (|2.3p ~ (|2.5p we deduce that (|2.2p must be true, which proves the lemma. □ 
We are now ready to prove the main result of this section: 

Theorem 2.1. The 0-maximal polynomials of degree n > 2 are given by: 

(i) z 2m + e lS z if n — 2m and m > 1, where 9 G R. 

(ii) z 2m+1 + Xe t9 z m+1 + e l2e z if n = 2m + 1 and m > 1, where A, G R trnd 

I a, I / 2V2m+l 
|A| < m + 1 • 

Proof. Let p be a 0-maximal polynomial. Since p G 0), we may write 

n—1 n—1 

p(z) = z" + afcZ fc = z J^J (z — Zi), where \zi\ < 1, 1 < i < n — 1, 
fe=l i=i 

n—1 n—1 
]/(z) = 77Z™ _1 + fcafeZ fe_1 = 77 J^J (z — Iffj). 

fc=l J=l 

By comparing p with the polynomial q{z) := z™ — z G 5(n, 0) we get 

i 

1\ ~ 



min |u)j| = |p| > \q\ = 

\<3<n— 1 \77 

which combined with the identity n2=i ^ = ( — 1)"~V(0) = n Ilj=i w i yields 
n™^ 1 — n Ilj 1 =i L |wj | > 1. Since \zi\ < 1 for 1 < i < n — 1 we deduce that 

N = 1, 1 < « < n- 1, and |io 3 -| = ( - J , 1 < j < n - 1. (2.6) 
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It follows in particular that |ai| = | ^"Tj 1 Zj| = 1, hence 

ax = e ae (2.7) 

for some 6 E R. From (|2.6p and Lemma 2.1 applied to the polynomial z p(z) we 
get 

a„_ fe = e~ l20 a k+ i, 1 < k < n - 2, 

while (|2.6|) and Lemma 2.2 applied to the polynomial p with a = and i? = (^) 1,-1 
imply that 

^-^^^f-^e-^Ok+i, l<A;<n-2. 
n — k \nj 

Thus 

(n - k) - n(k + 1) a k+1 = 0, 1 < fc < n - 2, (2.8) 

and then by Lemma 2.3 we get that eifc = for 2 < k < n — 1 if n is even. 

Let now n = 2m + l, m > 1. In this case Lemma 2.3 and (I2.8[) imply that = 
for 2 < fc < n — 1, k ^ m + 1, so that 

p(z) = z 2m+1 + a m+1 z m+1 + e l2e z, (2.9) 

where is as in f|2.7p . From (|2.6p and (|2.9p we obtain 



2m 2m 
i=l i=l 



Note that by Newton's identities one has actually that 

2m Tit— 1 / 2m \ 

y^zf = -ma m+ i + a„ i+fc+ i f ^ zj'J = -ma m+ i. 

i=i fe=i S=i 

We deduce from the last two formulas that a m +i — e l28 a m+ i, which together with 
(|2.9|) implies that the polynomial p must be of the form 

p{z) = z 2m+l + Xe i9 z m+1 + e l29 z, where A e R. 

An elementary computation shows that if 0, s, t € R with £ > then the roots of 
the equation x 2 + se l8 x + te t2e — cannot have the same absolute value unless 
s 2 < 4t. By (|2.6|) all the zeros of the polynomial 

p\z) = (2m + l)z 2m + (m + l)Ae l V" + e i2e 

have the same absolute value. Using the substitution x — z m and the above- 
mentioned result on second degree equations we see that this cannot happen unless 

I x I s 2V2to+1 
l A l - m+l ■ 

l 

To summarize, we have shown that if p G S(n,0) then \p\o < unless p 

is of the form (i) or (ii) according to the parity of n. On the other hand, it is 

i 

easily checked that a polynomial of the form (i) or (ii) satisfies |p|o = (^) , which 
completes the proof of the theorem. □ 

Remark 2.1. Let p be a 0-maximal polynomial of degree n > 2. By Theorem 12. II 

all the critical points of p lie on the 0-critical circle \z\ = (i) and all the zeros 
of p except lie on the unit circle. Thus, all 0-maximal polynomials satisfy Miller's 
conjecture (Conjecture [3]). Note also that if p is 0-maximal then it has only simple 
zeros. The same is true for p' except when n is odd and p is of the form (ii) with 
|A| = in which case p' has distinct zeros each of multiplicity two. 

Corollary 2.1. If p is a 0-maximal polynomial of degree n > 2 then d(p) = \p\o- 
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Proof. Let z £ Z(j>) \ {0}, so that \z\ — 1 by Remark 1 2. II Denote by H z the closed 

half-plane which contains z and is bounded by the line passing through § which is 

orthogonal to the segment [0, z]. Since p(z) = = p(Q) it follows from a well-known 

consequence of the Grace-Heawood theorem that there exists w £ Z(p') D H z (see, 

e. g., the supplement to Theorem 4.3.1 in [13]). By Remark |2 . II again one has that 
i 

\ w \ — \p\o = (^) " _1 > I- It is then geometrically clear that \z — w\ < \p\o, so that 
min weZ (p') \z-w\< \p\ . □ 

Remark 2.2. Theorem 12.11 and Corollary 12. II do not automatically imply that 0- 
maximal polynomials are global maxima for the restriction of the function d to the 

subset S(n, 0) of S n . For this one would have to show that d{p) < (^) " _1 whenever 
p £ S(n,0), which we actually conjecture to be true. Note that the best estimate 
known so far for p £ S(n, 0) is d(p) < 1 (see [131 Theorem 7.3.6]). 

Remark 2.3. It is not difficult to see that that max pe s( n l ) \p\i = 1 (cf. [15]). On 
the other hand, Theorem 12. II shows that uiaXp e s(n,o) \p\o increases to 1 as n — ► oo. 
It would be interesting to know whether there exist a £ D and c £ (0, 1) such that 
max p6S( „ Q) \p\ a < c for any n > 1. 

To end this section let us point out that Theorem 12.11 solves in fact the following 
more general extremal problem concerning the distribution of zeros and critical 
points of complex polynomials: 

Problem 1. Let n > 2, a G C, and R > 0. Find the largest constant p :— p(a, n, R) 
with the property that for any complex polynomial p of degree n satisfying p(a) = 
and min^g^^/) \w — a\ > R one has msx zG z(j>) \ z ~ a \ ^ P- 

It is easy to see that p is invariant under translations in the complex plane so 
that it does not depend on a. Moreover, the Gauss-Lucas theorem clearly implies 
that p > R. On the other hand, by considering p(z) = (z — a) n + nR n ^ 1 (z — a) 
we see that p < Rn 1 ^ 1 . Essentially the same computations as in the proof of 
Theorem 12.11 show that one has in fact p = Rn » I=T : 

Theorem 2.2. With the notations of Problem 1 one has p = Rn™- 1 and this value 
is attained only for the following polynomials: 

(i) (z - a) 2m + 2mR 2m ~ 1 e ie (z - a) if n = 2m and m > I, where 9 € R. 

(ii) (z - a) 2m+1 + \^2m + lR m e l6 {z - a) m+1 + (2m + l)R 2m e i26 (z - a) if 
n = 2m + 1 andm> I, where A, 9 £ R and |A| < 2 ^"+ 1 . □ 

2.2. Second order approximations of the critical points and local maxima 
for Sendov's conjecture. As shown in Lemma [L~T] the polynomial z 3 + z and its 
rotations are inextensible with respect to but they are not local maxima for the 
function d. On the other hand, all the properties of the polynomial p(z) = z n + z 
that we discussed so far seem to suggest that if n > 4 then p and its rotations could 
in fact be locally maximal for Sendov's conjecture. Indeed, Theorem 12. II combined 
with the fact that \p\^ < \p\o = d(p) for Q £ Z(p) \ {0} implies that the polynomial 
p is locally maximal for the restriction of the function d to S(n, 0). Moreover, p 
is locally maximal for variations such as those described in Proposition 11.31 In 
addition to that, the zeros of p and its critical points are symmetrically distributed 
and satisfy Conjecture [3] Finally, by Theorem 11.21 the polynomial p is inextensible 
with respect to 0. We shall now use a second order variational method to show 
that - contrary to what one might expect from the aforementioned properties - 
the polynomial p is not locally maximal for Sendov's conjecture. (The discussion 
in section 1 clearly shows that first order variational methods are not enough for 
deciding whether p is a local maximum for d or not.) Theorems 12.31 and 12.41 below 
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show that if n = 4 or 5 then p is a kind of inflection point for the function d. We 
conjecture that the same is actually true for all degrees (Remark l2.4|) and also that 
polynomials of the form z n + e l9 , 9 £ R, are in fact all the local maxima for the 
function d (Conjectured]). These results complement those obtained in section 1 
and show quite clearly that the methods used in [17j cannot provide successful ways 
of dealing with Sendov's conjecture in its full generality. 
Let a £ [0, 1] and set 



^0= VI' 



CO) = exp 



a\a + a 2 a 



3V3r 
2-3r' 

,21 " 



Ct 2 



V3 [(3r + 2) 2 + 4] 
2(3r-2) 2 



(2.10) 



We use the quantities in (|2.10|l in order to construct certain continuous deformations 
of the polynomial z + z. For sufficiently small a > these perturbations give rise to 
a one-parameter family of polynomials in S4 with the following interesting property: 



Theorem 2.3. Let £(a) be as in (|2.10[) and define the polynomial 

£ S 4 . 



p a (z) := (z - a)(z + 1) [z - ((a)} z - ((a) 



Then for all sufficiently small a > one has A(p a , z A + z) = 0(a) and 



(2.11) 



d{Pa) = \Pa\a = r + Ca 2 + C(a 3 ), where C 



4r(2- 3r) 



10.81154938. 



In particular, the polynomial z 4 + z is not locally maximal for Sendov's conjecture. 

Proof. Let us first describe the steps we took in order to arrive at the quantities 
in (|2.10p and the one-parameter family of polynomials defined in (12. lip . We start 
by introducing six real parameters which we denote by Xi, yi, 1 < i < 3, and we 
define the auxiliary polynomials 



P a (z:xi,x 2 ,x 3 ) = (z - a) [z - zi(a;x 3 )} [z - z 2 {a;xi 1 x 2 )} z - z 2 (a;xi,x 2 ) 

3 

= z 4 +2_ J bj{a]Xi,X2,x s )z j , 
3=0 



}a(z;yi,y 2 ,y 3 ) = 4[z- wi(a;y 3 )] [z - u 2 (a;y 1 ,y 2 ,ya)} z - w 2 (a; yi, 2/2,2/3) 

2 

= 4z 3 + J2ck(a; yi ,y 2 ,y 3 )z k , 



k=0 



where 



r / 7r 

z 1 (a;x 3 ) = -1 + x 3 a 2 , z 2 (a; Xi, x 2 ) = exp i^— 



+ X\a + x 2 a' 



wi(a;y 3 ) =a - (r + y 3 a 2 ) , 

^2(0; 2/1, 2/2, 2/3) = a+ (r + y 3 a 2 )exp 



(2.12) 



2/1 a + y 2 a 



The idea is now to investigate whether the parameters Xi and yi, 1 < i < 3, may 
be chosen so that the following conditions are satisfied: 

(i) £3 > and y 3 > 0; 

(ii) for all sufficiently small a > one has 

max \P' a {z;xi,x 2 ,x 3 ) - Q a {z;yi,y 2 ,y 3 )\ = 0{a 3 ), 
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where P' a (z; x\, x%, X3) denotes the derivative of P a (z; x%, X2, 2:3) with respect to z. 
To do this, we expand the coefficients of P' a (z; Xi, x%, X3) and Q a (z; 2/1, 2/2, 2/3) into 
their MacLaurin series so as to get second order approximations of these coefficients 
(with an error of 0(a 3 )). Let bj(a;xi,x 2 ,x 3 ), 1 < j < 3, and Sfe(a; 2/1,2/2,2/3), 
< k < 2, be the resulting second degree MacLaurin polynomials in the variable 
a for the coefficients bj(a; Xi, X2, X3), 1 < j < 3, and Ck{a; 2/1, 2/2, 2/3), < k < 2, 
respectively. Then we may write 



(m + l)6 ro+ i(a; 2:1,2:2,2:3) - c m (a; 2/1, 2/2, 2/3) 

2 

= ^ d mn (x 1 ,X2,x 3 ,y 1 ,y 2 ,y3)a n , 



< to < 2, (2.13) 



n=0 



where d mn (xi, X2, X3, 2/1, 2/2, 2/3), < m, n < 2, are real polynomials in the variables 
afi, 2:2,2:3, 2/1, 2/2, 2/3- Clearly, any solution (2:1, x 2 , x 3 , 2/1, y 2 , 2/3) € R 6 to the system 
of polynomial equations 



,(afi, X2, %3,yi, 2/2, 2/3) = 0, 0<m,n<2, 



(2.14) 



that satisfies X3 > and 2/3 > will also satisfy conditions (i) and (ii) above. It 
turns out that if we let X3 — then system (|2.14p may be reduced to a system of five 
linear equations in the variables x±, X2, yi, 2/2, 2/3 which admits a unique solution. 
We arrive in this way at the following solution to system (|2.14l) : 

3\/3 



2/2 



ai, x 2 — a>2, X3 = 0, 

3V3(12r 2 + 8r + 3) 
8r 2 (3r-2) 2 ' 



III 



ys = c 



2r(2-3r)' 
3 



(2.15) 



4r(2 - 3r) ' 



where r, ol\ and 012 are as in (|2.10p . Henceforth we assume that the values of the 
parameters 2:1,2:2, ^3,2/1, 2/2, 2/3 are those listed in (|2.15jl . In particular, this implies 
that P a (z; xi,X2, X3) is the same as the polynomial p a (z) given in (|2.1ip . Note that 
by (|2.10[) one has A(p a ,z 4 + z) — O(a) for all small positive a. To simplify the 
notations, the zeros of the polynomial Q a (z; yi, 2/2, 2/3) will be denoted by u>i(a), 
0J2(a) and ^(a), respectively. Then (|2.12j) and (|2 . 1 5[) imply that 

\u) X {a) -a\ = \Lu 2 {a) - a\ = r + Co 2 . (2.16) 

It is now practically clear that the desired conclusion should follow from the 
Newton- Raphson algorithm. We check this in a rigorous way by using the following 
simple observation. 

Lemma 2.4. Let R be a complex polynomial of degree d > 2. If w G C is such 
that i?'(w) 7^ then there exists a zero z of R such that 

R(w) 



w 



z\<d 



R'(w) 



Proof. If R(w) = there is nothing to prove. Otherwise, we let Zi, . 
the zeros of R. Then 



, Zd denote 



E 



1 



> 



E 



1 



R'(w) 



R(w) 



which proves the lemma. 



□ 



Let u>i(a), u>2(a) and 1113(a) denote the critical points of the polynomial p a (z). 



Since p a (z) 



z as a — > 0, we may label these critical points so that w\(a) £ 
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R, 3(u>2(a)) > and u>3(a) = 11)2(0,) if a is positive and sufficiently small. A 
straightforward computation shows that 



|/>i(a)) =' 6 - + 0{a) and p> 2 (a)) = p'i (c 2 (a)) = + O(a), 



which combined with Lemma 12.41 and condition (ii) implies that 

u>i(a) = uii(a) + 0(a 3 ) and W2(a) = w 3 (a) = W2(a) + 0(a 3 ). 
The theorem is now a consequence of (|2.f 6|) . 



□ 



As one may expect, the second order variational method that we used in the proof 
of Theorem 12.31 works even in more general cases. However, for higher degrees the 
procedure described above requires an increasingly large amount of computations. 
Tedious as they may be when done by hand, for small degrees these computations 
become relatively easy if one uses for instance a Maple computer program. Indeed, 
such a program has considerably simplified our task in the course of proving Theo- 
rem [53] and it also allowed us to obtain a similar result for the polynomial z 5 + z. 
In order to formulate this result we need to introduce some additional notations: 
let a € [0, 1] and set 

4\/2 



\z 5 + z\ 



60s 4 - 19 
50s 2 (2s 2 -l) 2 ' 







K 



2V2s 2 
1 - 2s 2 ' 
2 



7 



77(a) = exp 



[3a 



X2(a) = a + (s + Ka 2 ) exp 



5s(l-2s 2 )' 

. , , ., - 5.665658792, 
5s(l — 2s 1 ) 

Xi(a) = a + (s + Ka 2 ) exp 
3tt 

T 



7a - 



da' 



+ -fa — 5a' 



We use these quantities in order to define the following one-parameter families of 
polynomials: 



Qa(z) := (z — a)(z — 77(a)) (z — iri(a)) z — 77(a) z + irj(a) G 5 5 , 



s a (z) := 5(z - Xi( a ))( z - X2{a)) \z - xi(a) \z - X2(«) j 

Our next result is an analogue of Theorem l2.3l for the polynomial z 5 + z. Its proof 
is similar to that of Theorem 12.31 and is therefore omitted. 



Theorem 2.4. For all sufficiently small a > one has 

la S a 



A(q a , z J + z) = 0(a) and &(j,f) = 0(a a ), 

so that d(q a ) = \q a \a — s + Ka 2 + C(a 3 ). In particular, the polynomial z 5 + z is 
not locally maximal for Sendov 's conjecture. □ 



Remark 2.4. In view of Lemma rTTTI and Theorems 12 . 3H2741 it seems reasonable to 
conjecture that polynomials of the form z n + z or, more generally, the 0-maximal 
polynomials of degree n > 2 given in Theorem 12.11 are not local maxima for the 
function d. One could for instance try to find an algorithmic proof of this conjecture 
based on the second order variational method that we just described. However, we 
believe that the results of this section are enough to emphasize both the limitations 
of what can be achieved through first order variational methods and the necessity 
of using higher order approximations when dealing with problems such as Sendov's 
conjecture. In particular, these results reinforce our earlier statement that methods 
such as those used in [T7] and [TS] can only lead to already known partial results 
and are unlikely to be successful in the general case. 



MAXIMAL AND INEXTENSIBLE POLYNOMIALS 



21 



As we already pointed out in the introduction, the fact that the function dor 
fails to be (logarithmically) plurisubharmonic in the polydisk D n accounts for many 
of the difficulties in studying locally maximal polynomials for Sendov's conjecture. 
Nevertheless, determining all such polynomials is a natural and interesting question. 
The polynomials in Conjecture [5] (i. e., the polynomials which are conjectured to 
be extremal for Sendov's conjecture) are the only examples of local maxima for 
the function d known so far (cf. [TU] and [U])- We propose the following stronger 
version of Conjecture [H 

Conjecture 4. Let p 6 S„ . Then p is locally maximal for Sendov's conjecture if 
and only if p(z) = z 11 + e l9 for some 9 € R. 

To the best of our knowledge, Conjecture [J] has not been stated explicitly in the 
literature. As one can see from the discussion in sections 1 and 2, any approach 
based on local variational methods that were to confirm Conjecture [2] would also 
have to confirm Conjecture [U 

3. Differentiators of normal operators, majorization, and the 



The results known so far on Sendov's conjecture and related questions were 
almost exclusively obtained by analytical arguments. In this section we propose an 
operator theoretical approach to Conjectures [T] and [2] and show that these may be 
viewed as part of the more general problem of describing the relationships between 
the spectra of normal matrices and the spectra of their principal submatrices. We 
also give a geometrical characterization of the critical points of complex polynomials 
by means of multivariate majorization relations. 

3.1. Sendov's conjecture and spectral variations of normal operators and 
their compressions. Let Ti. be a n-dimensional complex Hilbert space with (uni- 
tarily invariant) scalar product (•,■) and identity operator I-h € L(Ti), where L(Tl) 
is the set of all linear operators on H. We may view the spectrum Eig(A) of an 
operator A £ L(7i) as the multiset whose elements are the eigenvalues of A, each 
eigenvalue occurring as many times as its algebraic multiplicity. The spectral radius 
of A is then given by p(A) = max zeEig ( j4 ) \z\. Let JC be a subspace of 7i, A £ L(H) 
and B 6 L(JC). Following [5] (see also [H]), we define the spectral variation s(A,B) 
of A and B to be the directed Hausdorff distance from Eig(A) to Eig(S), that is, 



It was shown in [11, Theorem 1.11] that any A S L(H) has a so-called trace 
vector, i. e., a vector v that satisfies v* A k v — itr(A fc ) for any non-negative integer 
k. It is not difficult to see that if A e L(H) is normal (i. e., AA* — A* A) and 
(ei,...,e„) is an orthonormal basis of Ti consisting of eigenvectors for A then 
v„ := ;^=(ei + • • • + e„) is a trace vector of A. Let P denote the orthoprojection on 

the subspace JC := v^; of Ti and define the P- compression of A to be the operator 
A' = PAP\/c € L(JC) (cf. [4]). Then one can show the following result. 

Proposition 3.1. Let A S L(Ti.) be a normal operator with spectrum Eig(A) = 
{zi, . . . , z n } and let A' be as above. Then 
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max mm 

z6Eig(A) weEig(B) 



Z 



W\. 




for any z ^ Eig(A) . 
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Proof. Let (vi, . . . ,v„_i) be an orthonormal basis of K.. Then the matrix repre- 
sentation of A' in the basis (vi, . . . , v n _i) is given by the (n — 1) X [n — 1) upper 
left-hand principal submatrix of the matrix representation of A in the orthonormal 
basis (vi, . . . , v„_i, v„) of TL. For any z ^ Eig(^4) the (n,n) entry of the matrix 
representation of the (normal) operator (A — zlu)^ 1 in the basis (vi, . . . , v n _i, v„) 
is given on the one hand by 



On the other hand, by Cramer's rule the (n, n) entry of the matrix representation 
of (A — zlji)^ 1 in the basis (vi, . . . , v„_i, v„) of TL is given by the cofactor of the 
(n, n) entry of the matrix representation of (A — zln) in the same basis. Thus 



Remark 3.1. The P-compression A' of the normal operator A is not necessarily 
normal itself. As it was shown in [TTJ Proposition 3.4], this occurs if and only if 
the eigenvalues of A are collinear complex numbers. 

By Proposition 13.11 up to a factor — ^ the characteristic polynomial of the P- 
compression A' of A coincides with the derivative of the characteristic polynomial 
of A. For this reason, the orthoprojection P is called a differentiator of A (cf. [1] 
and [TT] Definition 1.3]). Clearly, for any complex polynomial p(z) of degree n > 2 
there exists a normal operator A £ L(TL) such that the characteristic polynomial 
of A is ( — l) n p(z). Indeed, if p(z) = YYi=i( z — z i) then one may just consider 
an operator whose matrix representation in a given orthonormal basis of TL is 
diag(zi, . . . , z n ). We deduce that Conjectures 1 and 2 can be formulated exclu- 
sively in terms of the spectra of a normal operator and its compression to the 
orthogonal complement of a trace vector: 

Conjecture 5. Let Abe a normal operator on a complex Hilbert space of dimension 
n > 2 and let A' denote the compression of A to the orthogonal complement of a 
trace vector of A. Then s(A, A') < p(A) and equality occurs if and only if either 
A = or A has simple eigenvalues and A n is a non-zero scalar operator. 

Note that by the results of [5] we know that Conjecture [5] is true for normal 
operators on a complex Hilbert space of dimension n < 8. A natural question that 
arises in this context is whether there exists an analogue of Conjecture for the 
directed Hausdorff distance from Eig(A') to Eig(A). The following theorem gives a 
complete answer to the latter question. 

Theorem 3.1. Let A be a normal operator on a complex Hilbert space of dimension 
n > 2 and let A' denote the compression of A to the orthogonal complement of a 
trace vector of A. Then s(A',A) < p(A) and equality occurs if and only iftr(A) = 
and min zeEig(A) \z\ = p(A). 

Theorem 13. II is an immediate consequence of the following more general result: 





((A-zI n )- x v.. 




det(A' - zL K ) 
&et(A-zL n ) ' 



which proves the proposition. 



□ 
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Lemma 3.1. Let M and N be finite point sets in the complex plane such that M 
is contained in the closed convex hull of the points in N . Then 

max min I w — z\ < max I z I 

weM z£N zeN 

and equality occurs if and only if G M and min zS Ar \z\ = max ze N \z\. 

Proof. It is geometrically clear that min Z £jv \w — z\ 2 < max ze pf \z\ 2 — \w\ 2 for any 
w G M. This implies that if =/= w € M then min ze jv \w — z\ < max z6 7v \z\. □ 

A characterization of trace vectors and differentiators of arbitrary operators on 
finite-dimensional Hilbert spaces was given in [TTJ Theorem 2.5]. Using this char- 
acterization we can strengthen Proposition 13. II in the following way: 

Proposition 3.2. Let A S L(H) be a normal operator with spectrum Eig(A) = 
{zi, . . . , z n } and let (ei, . . . , e„) be an orthonormal basis of Ti consisting of eigen- 
vectors for A. Set 

1 - 

Vj = -f= V r? y e,-, 1 < i < n, 

3 = 1 

where i] = exp^ 27r ^~^ ^ . TTien (vi, . . . , v„) is an orthonormal basis ofTi consisting 

of trace vectors for A. Thus, if Pi denotes the orthoprojection on the subspace 
JCi := = C-span{vx, . . . ,Vi, . . . ,v n } of TL and Ai — PiAPi\^ i G LQCi) is the 
Pi -compression of A then 



det(Aj - zI Ki ) = - 
n 



for any z ^ Eig(A). 



n , 



det(A - zI H ) = --4~ [det(A - zI H )} 
n dz 



Proof. It is easy to check that (vi, . . . , v„) is an orthonormal basis of Tl. The same 
computations as in the proof of Proposition 13. II show that one the one hand 



3 = 1 3=1 J 



and on the other hand 

detjA, - zI Ki ) 
det(A - zl n ) 



((A-zIn) 1 v ! ,v l ) 



for 1 < i < n. By [TTJ Theorem 2.5] this is the same as saying that the vectors Vj, 
1 < i < n, are trace vectors of the operator A. □ 



The following consequence of Proposition 13.21 should prove useful for studying 
the geometry of the zeros and critical points of complex polynomials by operator 
theoretical methods. 

Corollary 3.1. Let p be a monic complex polynomial of degree n>2. Then there 
exists a normal n x n complex matrix A with the following properties: 

det(A — zln) = an d det(Arji — zl„„i) = - — p (z), 1 < i < n, 

1 ' n 

where Arji denotes the degeneracy one principal submatrix of A which is obtained 
by deleting the i-th row and i-th column from A. 



24 



JULIUS BORCEA 



Proof. Let p(z) — Jir=i( z ~ Zi ) ano - consider the operator A whose matrix repre- 
sentation in a given orthonormal basis (ei, . . . , e„) of TL is B = diag(zi, . . . , z n ). 
Define the following unitary n x n matrix: 

rfi ( 27T\/ 1 N 

U = (uij), where uu = —=, 1 < i, j < n, and rj = expl 

y/n ' \ n 

and set A = U*BU. Then A is the matrix representation of the operator A in 
the orthonormal basis (vi, . . . , v„) of TL which was constructed in Proposition 13.21 
Moreover, A^j is precisely the matrix representation of the Pi-compression A; of A 
in the orthonormal basis (vi, . . . , v,, . . . , v n ) of v^ - for 1 < i < n. It follows from 
Proposition 13. 21 that A must have the desired properties. □ 



The results that we have just presented make use of the ideas developed by Davis 
in [3] and Pereira in In conjunction with methods of majorization theory, the 
operator theoretical tools introduced in [TT] were the key to Pereira's main results, 
namely some beautiful solutions to the 1947 conjecture of de Bruijn and Springer 
and the 1986 conjecture of Schoenberg. Similar ideas were used by Malamud in [5J, 
where he not only proved these same two conjectures - independently and almost 
at the same time as Pereira - but he also obtained a remarkable generalization of 
the de Bruijn- Springer conjecture (see section 3.2 below). The methods used in 
[6J and [UJ seem to be particularly well suited for studying extremal problems for 
which the loci of the zeros of extremal polynomials are (conjectured to be) lines 
in the complex plane (cf. Remark 13. ip . Notwithstanding, the ideas of [5j and [TJJ 
are a new and powerful approach to the study of the geometry of polynomials. We 
believe that the results and the setting developed in this section should prove useful 
for investigating Conjecture EJ that is, Sendov's conjecture formulated in terms of 
normal operators. 

3.2. Majorization and a characterization of the critical points of complex 
polynomials. Although useful in many contexts, the geometrical information con- 
tained in the Gauss-Lucas theorem is hardly sufficient for dealing with problems 
such as Sendov's conjecture. This is mainly because of the implicit nature of the re- 
lations between the zeros and the critical points, which actually accounts for many 
of the difficulties in studying the geometry of polynomials. Describing these rela- 
tions geometrically and as explicitly as possible would be helpful for a great many 
questions in the analytic theory of polynomials. Indeed, most of these questions 
relate in various ways to the following fundamental problem: 

Problem 2. Let zi, . . . , z n and wi, . . . , w n -i be 2n — 1 points in the complex 
plane, where n > 2. Find necessary and sufficient geometric conditions in order for 
W\, . . . , w n -\ to be the critical points of the polynomial Jl™=i( z ~ 

Naturally, there may be several possible ways of answering Problem [5] depending 
on the context in which one places this problem and the approach that one uses. In 
this section we build on the results of [6] in order to give an answer to Problem [2] in 
terms of majorization, which is a fundamental concept in the theory of inequalities 
as well as matrix analysis and operator theory (see [?])■ 

Let us start with the following definition. 

Definition 3.1. An m x n matrix R = (fy) with m < n is called rectangularly 
stochastic if it satisfies the following conditions: 

n m 

r ij > 0, Tij — 1 and nj — — for 1 < i < m, 1 < j < n. 

j=l »=1 n 
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Note that if m = n then Definition 13 . 1 1 reduces to the usual definition of a doubly 
stochastic (or bistochastic) matrix. In what follows we shall identify an m-tuple 
X = (xi, . . . , Xmf consisting of vectors in R k with an m x k real matrix in the 
obvious manner. 

The following theorem is a fundamental result in the theory of multivariate 
majorization which is originally due to Sherman (|22J). 

Theorem 3.2. Let X — [x\, . . . , x m ) and Y = (j/i, . . . , y n ) be an unordered m- 
tuple and n-tuple, respectively, of vectors in R fe , where n > m and k > 1. Then the 
following conditions are equivalent: 

V™ fix,) V™_ ffyA 
(i) For any convex function f : R k — > R one has — — — < ' 1 



m n 

(ii) There exists a rectangularly stochastic m x n matrix R such that X = RY , 
where X is an m x k matrix and Y is an n x k matrix obtained by some 
(and then any) ordering of the vectors in X and Y . 
If (i) and (ii) are satisfied then we say that X is majorized by Y and write X <Y . 

If X and Y consist of complex numbers then one defines the majorization relation 
X -< Y, when appropriate, by identifying C with R 2 in the above theorem. 

Remark 3.2. If m = n then Birkhoff's theorem (0 Theorem A. 2]) implies that 
the majorization relation -< defines a partial ordering on the set of unordered n- 
tuples of vectors in R fe . The case k = 1 and m = n is originally due to Schur and 
to Hardy, Littlewood and Polya and is often referred to as classical majorization. 
We refer to [7] for further details, including a discussion of the geometrical meaning 
of classical majorization. 

Remark 3.3. Surprisingly, Theorem 13. 21 was long assumed to be an open problem 
and it does not appear in [7], which is the definite reference on majorization theory. 
This may explain why Sherman's result has been rediscovered several times. 

Any geometric description of the critical points of a complex polynomial is rooted 
in, and so it should also reflect, some algebraic characterization of these points. One 
such algebraic characterization is given by the following lemma. 

Lemma 3.2. Let p{z) = 0™=i( 2; — z i) an d ( ?( 2: ) = n rij l =i( z — w j)- The following 
conditions are equivalent: 

(i) q(z) — p'{z) for any z 6 C. 

(ii) For any a € C and 1 < k < n — 1 one has 

77^1) £ IlK- a ) = H Yl II 

\ k ) l<i 1 <...<i k <n—lj=l l<ii<...<i k <nj=l 

(hi) The relations in (ii) are valid for a = 0. 

Proof. Let a G C. Then (i) is obviously equivalent to g("~ 1 ~ fe ) (a) = p( n ~ fc )(a) for 
1 < k < n — 1. The lemma follows from elementary computations. □ 

By combining [6l Theorem 4.6] with Theorem 13.21 and Lemma 13.21 we get the 
following answer to Problem [2] in terms of multivariate majorization relations and 
convex functions: 

Theorem 3.3. Let p(z) = n"=i( z — z i)' l( z ) — n Y\!j=ii z — w j) an d ct € C. For 

1 < k < n—1 define the following unordered (fy-tuple and (J 1 ^ 1 ) -tuple, respectively, 
of complex numbers: 

k 



Z(a,k) = f - a)) 

\7=1 ' 



l<ii <...<i k <n 



2(1 
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l<ii<...<ifc<n— 1 



W(a,k) = 

Then the following conditions are equivalent: 

(i) q(z) = p'(z) for any z £ C. 

(ii) For any a € C and 1 < k < n — 1 one /ias IU(a, k) -< i?(a, fc). 

(iii) For 1 < /c < n - 1 one has W(0, k) -< Z(0, fe). 

(iv) For any convex function f : C — * R, a G C and 1 < k < n — 1 one /ias 

A: ^ / k 

a)) 



1 



it ^ /(llK-«))^ E 

n-1 V j=l l<ii<...<i fc <ii V J = 1 



l<ii<...<i fc < 

(v) The relations in (iv) are valid for a = 0. 

The implication (i) =4> (iv) with fc = 1 and a — was originally conjectured by 
de Bruijn and Springer in [3] and is in itself a powerful generalization of the Gauss- 
Lucas theorem (see also [TTJ Theorem 5.4] for a proof of the de Bruijn- Springer 
conjecture). Unlike classical majorization, the geometrical meaning of multivariate 
majorization - in our case, majorization in R 2 - is not yet fully understood. In view 
of the remarkable achievements of [6] and further clarifying the geometrical 
information encoded in Theorem 13.31 should prove very useful for studying the 
geometry of polynomials in general and Problem [5] and Sendov's conjecture in 
particular. 

3.3. The spectra of principal submatrices of normal matrices. It is inter- 
esting to note that the method used in the proof of Proposition 13.11 leads actually 
to an analogue of the Gauss-Lucas theorem for normal matrices. Indeed, let n > 2 
and let A be a normal n x n complex matrix with spectrum Eig(^4) = {zi, . . . , z n }. 
As in section 3.1, for 1 < i < n we denote by iu the degeneracy one principal 
submatrix of A obtained by deleting the i-th row and i-th column from A. Let I n 
be the n x n identity matrix and (ui, . . . , u„) be the standard basis of C™ viewed 
as a complex Hilbert space with standard scalar product. Now fix an orthonormal 
basis (ei, . . . , e„) of C" consisting of eigenvectors for A. Using the same arguments 
as in the proof of Proposition l3 . 1 1 we get the following theorem of Gauss-Lucas type 
for normal matrices: 

Theorem 3.4. With the above notations one has 

det(A [{ \ - zJ w _i) _ y, |(uj,ej)| 2 
det(A - zl n ) ~ ^ Zj - z 

for z € C\Eig(A) and 1 < i < n. In particular, Eig(A^) is contained in the convex 
hull ofEig(A) for any i e {1, 2, . . . , n}. □ 

Let det(A— zl n ) = OfcLi (^fc — z ) nk i where YlT=i n k = n and the z^s denote the 
distinct eigenvalues of A with multiplicities > 1, 1 < k < m, respectively. Then 
the Zfe's are eigenvalues of each of the matrices j4rji, 1 < i < n, with multiplicities 

(i) 

at least n/. — 1, 1 < k < m, respectively. Thus, there exist complex numbers w) , 
1 < i < n, 1 < j < m — 1, such that 

m— 1 m 

det(A w - zln-x) = [] (™f - z ) Il( Zfe " z )" fc_1 - 
j=i k=i 

By Theorem 13.41 we get 

n ?= i k - zk > 



= nfe|(Ui,e fc )| , 1 < fc < m, 1 < i < n, 
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which proves the following proposition: 
Proposition 3.3. With the above notations one has 



nm—l/ U) 



Zk) 



Uj = l,^k( Z 3 - Z k) 

for all 1 < k < m and 1 < i < n. □ 

Note that if A is Hermitian then (|3.1j) simply means that for each is {1, . . . , n} 
the sequences of real numbers w[ l \ . . . , w^_ 1 and z\, . . . ,z m are interlacing, which 
is the well-known Cauchy-Poincare separation theorem for self-adjoint matrices (cf., 
e. g., 4 , where one may also find a converse to the Cauchy-Poincare theorem due 
to Ky Fan and Pall) . Proposition 13.31 may therefore be viewed as an analogue of 
the Cauchy-Poincare theorem for normal matrices. 

Proposition 13 . 31 is a slightly more general version of the necessity part of Propo- 
sition 3.1 in where a partial converse to this result was obtained. Note though 
that the conditions stated in loc. cit. are actually valid only in the generic case when 
A has simple eigenvalues. Indeed, the arguments used above show that Proposition 
3.1 in [6 should be stated as follows in the general case: 

Proposition 3.4. Let Ai, . . . , A n and fii, . . . , fi n -i be 2n— 1 complex numbers. The 
conditions 



lim 

A— j-Afc 



> 0, 1 < k < n, 



n3U,WAi-A)_ 

are valid if and only if there exists a ( not necessarily unique ) normal n x n matrix 
A such that Eig(A) = {A i; . . . , A„} and Eig(j4[ n ]) = {/j.i, ■ ■ ■ ;Mn-i}- 

Obviously, the critical points of the characteristic polynomial of a normal ma- 
trix satisfy both the "Gauss-Lucas property" (Theorem 13. 4p and - by Lemma |3~21 
-the "interlacing property" (Proposition 13. 3[) . In light of Theorem 13.41 and Propo- 
sition [3~3l we may view the spectra of the deficiency one principal submatrices of a 
normal matrix A as generalizations of the set of critical points of the characteristic 
polynomial of A. As already noted in section 3.1, any monic complex polynomial 
p of degree n may be realized - up to a factor (—1)™ - as the characteristic poly- 
nomial of a normal n x n matrix A. Since A has a differentiator, the polynomial p' 
may itself be realized - up to a factor i — ^ — and possibly also a unitary change 
of orthonormal basis - as the characteristic polynomial of a (not necessarily nor- 
mal) deficiency one principal submatrix of A. We conclude that Problem [5] and 
the study of the geometry of zeros and critical points of complex polynomials - 
including Sendov's conjecture - may be seen as part of the following more general 
problem: 

Problem 3. Let A be a normal n x n complex matrix. Describe the geometrical 
properties and the mutual location of the sets EigLAm) and Eig(A) for 1 < i < n. 

The example A — diag(l, 1, .... 1, —1) and Ar n i = diag(l, 1, . . . , 1) shows that 
the directed Hausdorff distance from Eig(A) to Eig(A n i) can actually be equal 
to the diameter of the convex hull of Eig(A). Thus, the "Gauss-Lucas theorem 
for normal matrices" (Theorem I3.4f> provides in fact an optimal upper bound for 
this directed Hausdorff distance, so that there is no apparent analogue of Sendov's 
conjecture for Problem [3] Nevertheless, in view of Theorem 13.31 it seems natural 
to look for an answer to this problem in terms of (weak) majorization relations 
between eigenvalues (cf. [7]) and inequalities of de Bruijn-Springer type. These 
questions will be addressed in a forthcoming paper. 
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